Benchmarking Spatial Join Operations with Spatial Output
نویسندگان
چکیده
The spatial join operation is benchmarked using variants of well-known spatial data structures such as the R-tree, R -tree, R-tree, and the PMR quadtree. The focus is on a spatial join with spatial output because the result of the spatial join frequently serves as input to subsequent spatial operations (i.e., a cascaded spatial join as would be common in a spatial spreadsheet). Thus, in addition to the time required to perform the spatial join itself (whose output is not always required to be spatial), the time to build the spatial data structure also plays an important role in the benchmark. The studied quantities are the time to build the data structure and the time to do the spatial join in an application domain consisting of planar line segment data. Experiments reveal that spatial data structures based on a disjoint decomposition of space and bounding boxes (i.e., the R-tree and the PMR quadtree with bounding boxes) outperform the other structures that are based upon a non-disjoint decomposition (i.e., the R-tree and R -tree). As the size of the output of the spatial join increases with respect to the larger of the two inputs, the advantage of the bounding boxes used in methods based on a disjoint non-regular decomposition is no longer a factor and methods based on a disjoint regular decomposition (i.e., the PMR quadtree regardless of the presence of bounding boxes) perform signi cantly better. When the output of the spatial join is not required to be spatial (i.e., it is a list or table of tuples), then the performance of the best methods based on a non-disjoint decomposition (i.e., the R -tree) is comparable to those with a disjoint decomposition (i.e., the R-tree and the PMR quadtree with and without bounding boxes) as long as the size of the output is considerably smaller than that of the larger of the two inputs (e.g., 10%). However, as the output gets larger, the methods based on a regular decomposition (i.e., the PMR quadtree with and without bounding boxes) are much better than those based on an irregular decomposition (i.e, the R -tree and the R-tree) which have comparable performance in this case regardless of whether or not they are based on a disjoint or non-disjoint decomposition of space.
منابع مشابه
Benchmarking Spatial Joins à la carte
Spatial joins are join operations that involve spatial data types and operators. Spatial access methods are often used to speed up the computation of spatial joins. This paper addresses the issue of benchmarking spatial join operations. For this purpose, we first present a WWW-based benchmark generator to produce sets of rectangles. Using a Web browser, experimenters can specify the number of r...
متن کاملAccelerating Spatial Join Operations using Bit-Indices
Spatial join is a very expensive operation in spatial databases. In this paper, we propose an innovative method for accelerating spatial join operations using Spatial Join Bitmap (SJB) indices. The SJB indices are used to keep track of intersecting entities in the joining data sets. We provide algorithms for constructing SJB indices and for maintaining the SJB indices when the data sets are upd...
متن کاملDistance-Associated Join Indices for Spatial Range Search
Spatial join indices are join indices constructed for spatial objects. Similar to join indices for relational database systems, spatial join indices improve efficiency of spatial join operations. In this paper, a distanceassociated join index structure is developed to speed up spatial queries especially for spatial range queries. Three distance-associated join indexing mechanisms: basic, ring-s...
متن کاملSpatial Join Processing Using Corner Transformation
ÐSpatial join finds pairs of spatial objects having a specific spatial relationship in spatial database systems. Since spatial join is a fairly expensive operation, we need an efficient algorithm taking advantage of the characteristics of available spatial access methods. In this paper, we propose a spatial join algorithm using corner transformation and show its excellence through experiments. ...
متن کاملSpatial Join with R-Tree on Graphics Processing Units
Spatial operations such as spatial join combine two objects on spatial predicates. It is different from relational join because objects have multi dimensions and spatial join consumes large execution time. Recently, many researches tried to find methods to improve the execution time. Parallel spatial join is one method to improve the execution time. Comparison between objects can be done in par...
متن کامل